PF-0515-1 CON 



SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Hillman, Jennifer L. 

Tang, Y. Tom 
Corley, Neil C. 
Guegler, Karl J. 
Yue , Henry 
Patterson, Chandra 

(ii) TITLE OF THE INVENTION: PROTEINS ASSOCIATED WITH EGF SIGNALING 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Genomics, Inc. 

(B) STREET: 3160 Porter Dr. 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Herewith 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/069,725 

(B) FILING DATE: April 29, 1998 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0515-1 CON 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-855-0555 

(B) TELEFAX : 650-845-4166 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: COLNNOTl 6 

(B) CLONE: 1281694 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Asn Asp Ser Leu Arg Thr Asp Val Phe Val Arg Phe Gin Pro Glu 
15 10 15 
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Ser 


He 


Ala 


Cys 


Ala 


Cys 


He 


Tyr 


Leu 


Ala 


Ala Arg 


Thr 


Leu 


Glu 


He 








20 










25 








30 






Pro 


Leu 


Pro 


Asn 


Arg 


Pro 


His 


Trp 


Phe 


Leu 


Leu Phe 


Gly 


Ala 


Thr 


Glu 






35 










40 








45 








Glu 


Glu 


He 


Gin 


Glu 


He 


Cys 


Leu 


Lys 


He 


Leu Gin 


Leu 


Tyr 


Ala 


Arg 




50 










55 








60 










Lys 


Lys 


Val 


Asp 


Leu 


Thr 


His 


Leu 


Glu 


Gly 


Glu Val 


Glu 


Lys 


Arg 


Lys 


65 










70 










75 








80 


His 


Ala 


He 


Glu 


Glu 


Ala 


Lys 


Ala 


Gin 


Ala 


Arg Gly 


Leu 


Leu 


Pro 


Gly 










85 










90 








95 




Gly 


Thr 


Gin 


Val 


Leu 


Asp 


Gly 


Thr 


Ser 


Gly 


Phe Ser 


Pro 


Ala 


Pro 


Lys 








100 










105 








110 






Leu 


Val 


Glu 


Ser 


Pro 


Lys 


Glu 


Gly 


Lys 


Gly 


Ser Lys 


Pro 


Ser 


Pro 


Leu 






115 










120 








125 








Ser 


Val 


Lys 


Asn 


Thr 


Lys 


Arg 


Arg 


Leu 


Glu 


Gly Ala 


Lys 


Lys 


Ala 


Lys 




130 










135 








140 










Ala 


Asp 


Ser 


Pro 


Val 


Asn 


Gly 


Leu 


Pro 


Lys 


Gly Arg 


Glu 


Ser 


Arg 


Ser 


145 










150 










155 








160 


Arg 


Ser 


Arg 


Ser 


Arg 


Glu 


Gin 


Ser 


Tyr 


Ser 


Arg Ser 


Pro 


Ser 


Arg 


Ser 










165 










170 








175 




Ala 


Ser 


Pro 


Lys 


Arg 


Arg 


Lys 


Ser 


Asp 


Ser 


Gly Ser 


Thr 


Ser 


Gly 


Gly 








180 










185 








190 






Ser 


Lys 


Ser 


Gin 


Ser 


Arg 


Ser 


Arg 


Ser 


Arg 


Ser Asp 


Ser 


Pro 


Pro 


Arg 






195 










200 








205 








Gin 


Ala 


Pro 


Arg 


Ser 


Ala 


Pro 


Tyr 


Lys 


Gly 


Ser Glu 


He 


Arg 


Gly 


Ser 




210 










215 








220 










Arg 


Lys 


Ser 


Lys 


Asp 


Cys 


Lys 


Tyr 


Pro 


Gin 


Lys Pro 


His 


Lys 


Ser 


Arg 


225 










230 










235 








240 


Ser 


Arg 


Ser 


Ser 


Ser 


Arg 


Ser 


Arg 


Ser 


Arg 


Ser Arg 


Glu 


Arg 


Ala 


Asp 










245 










250 








255 




Asn 


Pro 


Gly 


Lys 


Tyr 


Lys 


Lys 


Lys 


Ser 


His 


Tyr Tyr 


Arg 


Asp 


Gin 


Arg 








260 










265 








270 






Arg 


Glu 


Arg 


Ser 


Arg 


Ser 


Tyr 


Glu 


Arg 


Thr 


Gly Arg 


Arg 


Tyr 


Glu 


Arg 






275 










280 








285 








Asp 


His 


Pro 


Gly 


His 


Ser 


Arg 


His 


Arg 


Arg 













290 295 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: C0LNN0T16 

(B) CLONE: 1281694 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TTCGTGAAGC ACTCCATGGA GCATGTGTCA ATGGCCTGTG TCCACCTGGC TTCCAAGATA 60 

GAAGAGGCCC CAAGACGCAT ACGGGACGTC ATCAATGTGT TTCACCGCCT TCGACAGCTG 12 0 

AGAGACAAAA AATAATCGTT ATGTACCTTC AGGTGTTAGA GTGTGAGCGT AACCAACACC 180 

TGGTCCAGAC CTCATGGGTA GCCTCTGAGG GTAAGTGACT AAGACTTCTC CTCTGCTGTC 240 

CAAGCGCTTT GGTGCAGGGA CAGCGGCATC TTCAGCCAAT CCAGTGCAGG CTCTCCACCG 300 

AAGGCTGGCT CTAGACTGGT GACCCCTTGT TGAAATGGGA CAGTTGGCAG CGGCTCTGAT 3 60 

GAGCCCGAGA AGAGGCCTGC CCTTGGGTGC GGAGTCTCCC TCCGCACGAT GCTCCCACGC 42 0 

GTCCAACTTG CACCCAAGGG GCTTTTCCCT CTTCCAAGTG GACTCCTTCA AGGAAGC TGC 480 

AGCTCGGTCA GCAGAGAAGG GGCCTGCCGC CAGCGCCCTG GAGGAAGAGG AAGAGGAACC 540 

CAAGAGGATG GCTTGTCTCC CAGCAGCCAC ACCGGCTTTG TGCTCAGCCA GTTCATTTGA 600 

GTTTGCATGT TTCTCTGCAC TATGGATTTT GAGCATTTAG ATTTCTTTAA TCAAAAGCGT 660 

TTTAGTGACT CCAGTAGACA TTTTCTTTCT GAGG CAT CGT GCTTTGCATG AGAGCAGGCC 72 0 

AAGGTTGAGG GGAAAAGTAA AGTTAAAGTC GGTTCTCTTT CATAGCAACA CGTATTGTCT 780 

GACATTCAGC CAGCTTTTTT TTTTTCTAAT AATTTCTGTG CCTTTCTGTC CTGTATTTAC 840 

TGTATTTAGA AAAAGCAGCT AGAATATTTC TCCATTAACT CTTGAGATTC ACAGGACTGT 900 
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CTAGCTCTGA GTCCTAGCAA TAGACTCCTT AGAGGAGTAG TACGTTTATC TAGATTTTCT 960 

CTAGATAATG CAGGCGGAAG ACCTGGGTTC CCGGGTGGGG CATTGCAGTT CTTCCTGTGT 102 0 

TTGGCTTCCA GGAATTACAT GAACGACAGC CTTCGCACCG ACGTCTTCGT GCGGTTCCAG 1080 

CCAGAGAGCA TCGCCTGTGC CTGCATTTAT CTTGCTGCCC GGACGCTGGA GATCCCTTTG 1140 

CCCAATCGTC CCCATTGGTT TCTTTTGTTT GGAGCAACTG AAGAAGAAAT TCAGGAAATC 1200 

TGCTTAAAGA TCTTGCAGCT TTATGCTCGG AAAAAGGTTG ATCTCACACA CCTGGAGGGT 1260 

GAAGTGGAAA AAAGAAAGCA CGCTATCGAA GAGGCAAAGG CCCAAGCCCG GGGCCTGTTG 132 0 

CCTGGGGGCA CACAGGTGCT GGATGGTACC TCGGGGTTCT CTCCTGCCCC CAAGCTGGTG 1380 

GAATCCCCCA AAGAAGGTAA AGGGAGCAAG CCTTCCCCAC TGTCTGTGAA GAACACCAAG 1440 

AGGAGGCTGG AGGGCGCCAA GAAAGCCAAG GCGGACAGCC CCGTGAACGG CTTGCCAAAG 1500 

GGGCGAGAGA GTCGGAGTCG GAGCCGGAGC CGTGAGCAGA GCTACTCGAG GTCCCCATCC 1560 

CGATCAGCGT CTCCTAAGAG GAGGAAAAGT GACAGCGGCT CCACATCTGG TGGGTCCAAG 1620 

TCGCAGAGCC GCTCCCGGAG CAGGAGTGAC TCCCCACCGA GACAGGCCCC CCGCAGCGCT 1680 

CCCTACAAAG GCTCTGAGAT TCGGGGCTCC CGGAAGTCCA AGGACTGCAA GTACCCCCAG 1740 

AAGCCACACA AGTCTCGGAG CCGGAGTTCT TCCCGTTCTC GAAGCAGGTC ACGGGAGCGG 1800 

GCGGATAATC CGGGAAAATA CAAGAAGAAA AGTCATTACT ACAGAGATCA GCGACGAGAG 1860 

CGCTCGAGGT CGTATGAACG CACAGGCCGT CGCTATGAGC GGGACCACCC TGGGCACAGC 192 0 

AGGCATCGGA GGTGAGGCGG GGTTGCAGTG ACTGGTGGCC GCAAGCCCTT CCCTGGGGAG 1980 

TACCTGATGG CTGCCCTTTG ACCCCCGGTG GCTGCCCTTT GACCCCCGGG TGTGCTCTCA 2 040 

GCGCAAGTGG TCCTAGAACA GGATTCTTTT TGGAAATGTC TGTCGACTGG ACCTTGGTGG 2100 

ATTTGGAAAT GGAACTGAGG GACCGGTGAC AC GTGCTTCA GACCGGTCTG GGGTGCGGCG 2160 

CACACCTGGG CCCGTGCAGG GCTCAGCTCG GCAGCAGCTC TGAGGGCAGC TCAATGAAAA 222 0 

AGTGAATGCA CACGCCCTTG TTGGCGTGGC CTGGCATGGC CTGGTGCTAT CGGCAGCCGC 2280 

TCTCCACTCC CCGACTGATA CTCAATTACG TGAAGCCAAG AAAGATGATT TTTAGAACCT 2 340 

TTGCCTATAT TAGGTTGTAC TTATGTACAT ATTTTGCAGT GTTTCACAGG AGAAAGTGGC 2400 

CTTAACTGCC CCTTATTCTC TCTCCACGTT GTAAATAAAC ATGTGTTTAA TACAAGTTAA 2460 

AGCTATGTAT GAAAACTCAG AACTTGAATC CCGTCAGCTT AAAACTTGTG TAGGGAATCC 2 52 0 

TGACTTTTAA AATGTGAGGG TATTTGGATC TGTGTTGAAA GTCGTATATT TTTATC TGTG 2 580 

CGGTGCTGAG TGCAGGCCAC CAGCTCCTAA ATAGAGGTTC CCTATATGCG CGTATGACAT 2 640 

GGTGAATAAA CACAACTCTC TCCACTCAGG ACATCCGGAG CGTTATGGAC GTGGTAGGTG 2700 

GTCGTTCTGT GTGCTTGTGA AAGTGTC C AG GCGTGTGCAC AGCCAGTGCG CCCACTTCCG 2760 

GGCTCCTTGC TCCCTGCTGT ACTGAAGTTT TGGATTTTGC ATCCAATCCT GTGTGC CTGC 2 82 0 

CCTTCTGCCG AAGCTTGTGA GGGGCCTGAG TCCTCTGCCC ATCAGGATGA CAGGCTCCTT 2 880 

CCTGCAGGGC CATAGGAGGG AAGTTTTGGA AACACAGAAT GATTCCAAGG TGCTCTCGTT 2940 

CCTGAGGGGG ACTGGTTTGT AACCCATGAC ATC TGTGGGC GAGAGAGGCA GCTGGGAGCA 3 000 

GGACACTTGG AGGGTCACCC CACGGGGGTG GCACCTGCAC TCTGAGTGCC CCCCACTGTC 3 060 

ATCAGCTGCC TCTTACCGTG GACACAGTTT TGGTTTTGGG GACTAGGGGG CCCCACTCCT 3120 

GGTGGTACCG TTTGGACTTA CTAGGGCAGT GGGACATATA GGCCGGGGCT AGTGGGATAA 3180 

CGGGGAGTTA CGCCTGATGA CTTTTTTGAT GGAATCCTGC ATTAGATAGC TGGTGGGACC 324 0 

CCCCCCTCAG AATTGGGGAA CTGAGGAGAC TCCAGGGAGG GTGTCCTTCC AGGGAGAGCA 3 300 

GCTATGAGGG GCCCCCTAGC TTCCTGTGCC TGGAAGTAAG AGAACCAGTA AAGGGC C ATA 3360 

CACACCTGTA CCCAAGAGAC CGCTCTCCAT TTGCTTTCTT TTTTTACTAA ATAATTGTAA 3420 

AATATTATTA TGACATAAAG AACCATTTAA GGCCAAAAAA AAAA 3 464 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BEPINOT01 

(B) CLONE: 2056178 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



Met 


Tyr 


Ser 


Ala 


Gin 


Arg 


Phe 


Trp 


Gly 


Thr 


He 


Trp 


Ala 


Arg 


Arg 


Gly 


1 








5 










10 










15 




Ala 


His 


Leu 


Ala 


Pro 


Pro 


Asp 


Ala 


Ser 


He 


Leu 


He 


Ser 


Asn 


Val 


Cys 








20 










25 










30 






Ser 


lie 


Gly Asp 


His 


Val 


Ala 


Gin 


Glu 


Leu 


Phe 


Gin 


Gly 


Ser 


Asp 


Leu 






35 










40 










45 








Gly 


Met 


Ala 


Glu 


Glu 


Ala 


Glu 


Arg 


Pro 


Gly 


Glu 


Lys 


Ala 


Gly 


Gin 


His 




50 










55 










60 










Ser 


Pro 


Leu 


Arg 


Glu 


Glu 


His 


Val 


Thr 


Cys 


Val 


Gin 


Ser 


He 


Leu 


Asp 


65 










70 










75 










80 


Glu 


Phe 


Leu 


Gin 


Thr 


Tyr 


Gly 


Ser 


Leu 


He 


Pro 


Leu 


Ser 


Thr 


Asp 


Glu 










85 










90 










95 




Val 


Val 


Glu 


Lys 


Leu 


Glu 


Asp 


He 


Phe 


Gin 


Gin 


Glu 


Phe 


Ser 


Thr 


Pro 








100 










105 










110 






Ser 


Arg 


Lys 


Gly 


Leu 


Val 


Leu 


Gin 


Leu 


He 


Gin 


Ser 


Tyr 


Gin 


Arg 


Met 






115 










120 










125 








Pro 


Gly 


Asn 


Ala 


Met 


Val 


Arg 


Gly 


Phe 


Arg 


Val 


Ala 


Tyr 


Lys 


Arg 


His 




130 










135 










140 










Val 


Leu 


Thr 


Met 


Asp 


Asp 


Leu 


Gly 


Thr 


Leu 


Tyr 


Gly 


Gin 


Asn 


Trp 


Leu 


145 










150 










155 










160 


Asn 


Asp 


Gin 


Val 


Met 


Asn 


Met 


Tyr 


Gly 


Asp 


Leu 


Val 


Met 


Asp 


Thr 


Val 


































Pro 


Glu 


Lys 


Val 


His 


Phe 


Phe 


Asn 


Ser 


Phe 


Phe 


Tyr 


Asp 


Lys 


Leu 


Arg 








180 










185 










190 






Thr 


Lys 


Gly 


Tyr 


Asp 


Gly 


Val 


Lys 


Arg 


Trp 


Thr 


Lys 


Asn 


Val 


Asp 


He 






195 










200 










205 








Phe 


Asn 


Lys 


Glu 


Leu 


Leu 


Leu 


He 


Pro 


He 


His 


Leu 


Glu 


Val 


His 


Trp 




210 










215 










220 










Ser 


Leu 


He 


Ser 


Val 


Asp 


Val 


Arg 


Arg 


Arg 


Thr 


He 


Thr 


Tyr 


Phe 


Asp 


225 










230 










235 










240 


Ser 


Gin 


Arg 


Thr 


Leu 


Asn 


Arg 


Arg 


Cys 


Pro 


Lys 


His 


He 


Ala 


Lys 


Tyr 










245 










250 










255 




Leu 


Gin 


Ala 


Glu 


Ala 


Val 


Lys 


Lys 


Asp 


Arg 


Leu 


Asp 


Phe 


His 


Gin 


Gly 








260 










265 










270 






Trp 


Lys 


Gly 


Tyr 


Phe 


Lys 


Met 


Asn 


Val 


Ala 


Arg 


Gin 


Asn 


Asn 


Asp 


Ser 






275 










280 










285 








Asp 


Cys 


Gly Ala 


Phe 


Val 


Leu 


Gin 


Tyr 


Cys 


Lys 


His 


Leu 


Ala 


Leu 


Ser 




290 










295 










300 










Gin 


Pro 


Phe 


Ser 


Phe 


Thr 


Gin 


Gin 


Asp 


Met 


Pro 


Lys 


Leu 


Arg 


Arg 


Gin 


305 










310 










315 










320 


lie 


Tyr 


Lys 


Glu 


Leu 


Cys 


His 


Cys 


Lys 


Leu 


Thr 


Val 











325 330 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 
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(A) LIBRARY: BEPINOT01 

(B) CLONE: 2056178 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GCCTCCCTGT CCCCCGACCC TCTTTTGATG CCTCAGCAAG TGAAGAGGAG GAAGAAGAGG 60 

AGGAGGAGGA GGATGAAGAT GAAGAGGAGG AAGTGGCAGC TTGGAGGCTG CCCCCAAGAT 120 

GGAGTCAGCT GGGAACCTCC CAGCGGCCCC GCCCTTCCCG CCCCACTCAT CGAAAAACCT 180 

GCTCACAGCG CCGCCGCCGA GCCATGAGAG CCTTCCGGAT GCTGCTCTAC TCAAAAAGCA 24 0 

CCTCGCTGAC ATTCCACTGG AAGCTTTGGG GGCGCCACCG GGGCCGGCGG CGGGGCCTCG 3 00 

CACACCCCAA GAACCATCTT TCACCCCAGC AAGGGGGTGC GACGCCACAG GTGCCATCCC 3 60 

CCTGTTGTCG TTTTGACTCC CCCCGGGGGC CACCTCCACC CCGGCTGGGT CTGCTAGGTG 42 0 

CTCTCATGGC TGAGGATGGG GTGAGAGGGT CTCCACCAGT GCCCTCTGGG CCCCCCATGG 480 

AGGAAGATGG ACTCAGGTGG ACTCCAAAGT CTCCTCTGGA CCCTGACTCG GGCCTCCTTT 540 

CATGTACTCT GCCCAACGGT TTTGGGGGAC AATCTGGGCC AGAAGGGGAG CGCACTTGGC 6 00 

ACCCCCTGAT GCCAGCATCC TCATCAGCAA TGTGTGCAGC ATCGGGGACC ATGTGGCCCA 660 

GGAGCTTTTT CAGGGCTCAG ATTTGGGCAT GGCAGAAGAG GCAGAGAGGC CTGGGGAGAA 720 

AGCCGGC C AG CACAGCCCCC TGCGAGAGGA GCATGTGACC TGCGTACAGA GCATCTTGGA 780 

CGAATTCCTT CAAACGTATG GCAGCCTCAT ACCCCTCAGC ACTGATGAGG TAGTAGAGAA 840 

GCTGGAGGAC ATTTTCCAGC AGGAGTTTTC CACCCCTTCC AGGAAGGGCC TGGTGTTGCA 9 00 

GCTGATCCAG TCTTACCAGC GGATGCCAGG CAATGCCATG GTGAGGGGCT TCCGAGTGGC 9 60 

TTATAAGCGG CACGTGCTGA CCATGGATGA CTTGGGGACC TTGTATGGAC AGAACTGGCT 102 0 

CAATGACCAG GTGATGAACA TGTATGGAGA CCTGGTCATG GACACAGTCC CTGAAAAGGT 1080 

GCATTTCTTC AATAGTTTCT TCTATGATAA ACTCCGTACC AAGGGTTATG ATGGGGTGAA 1140 

AAGGTGGACC AAAAACGTGG ACATCTTCAA TAAGGAGCTA CTGCTAATCC CCATCCACCT 12 00 

GGAGGTGCAT TGGTCCCTCA TCTCTGTTGA TGTGAGGCGA CGCACCATCA CCTATTTTGA 12 60 

CTCGCAGCGT ACCCTAAACC GCCGCTGCCC TAAGCATATT GCCAAGTATC TACAGGCAGA 132 0 

GGCGGTAAAG AAAGACCGAC TGGATTTCCA CCAGGGCTGG AAAGGTTACT TCAAAATGAA 13 80 

TGTGGCCAGG CAGAATAATG ACAGTGACTG TGGTGCTTTT GTGTTGCAGT ACTGCAAGCA 1440 

TCTGGCCCTG TCTCAGCCAT TCAGCTTCAC CCAGCAGGAC ATGCCCAAAC TTCGTCGGCA 1500 

GATCTACAAG GAGCTGTGTC ACTGCAAACT CACTGTGTGA GCCTCGTACC CCAGACCCCA 15 60 

AGCCCATAAA TGGGAAGGGA GACATGGGAG TCCCTTCCCA AGAAACTCCA GTTCCTTTCC 162 0 

TCTCTTGCCT CTTCCCACTC ACTTCCCTTT GGTTTTTCAT ATTTAAATGT TTCAATTTCT 16 80 

GTATTTTTTT TTCTTTGAGA GAATACTTGT TGATTTCTGA TGTGCAGGGG GTGGC TAC AG 1740 

AAAAGCCCCT TTCTTCCTCT GTTTGCAGGG GAGTGTGGCC CTGTGGCCTG GGTGGAGCAG 1800 

TCATCCTCCC CCTTCCCCGT GCAGGGAGCA GGAAATCAGT GCTGGGGGTG GTGGGCGGAC 1860 

AATAGGATCA CTGCCTGCCA GATCTTCAAA CTTTTATATA TATATATATA TATATATATA 192 0 

TATATATATA TATATATATA TATATATATA AATATATAAA TGCCACGGTC CTGCTCTGGT 1980 

CAATAAAGAT C 1991 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 127 6645 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Met 


Glu 


Gly 


Pro 


Lys 


Lys 


Ala 


Gin 


Gly 


His 


Ser 


Pro 


Val 


Asn 


Gly 


Leu 


1 








5 










10 










15 




Leu 


Lys 


Gly 


Gin 
20 


Glu 


Ser 


Arg 


Ser 


Gin 
25 


Ser 


Arg 


Ser 


Arg 


Glu 
30 


Gin 


Ser 


Tyr 


Ser 


Arg 
35 


Ser 


Pro 


Ser 


Arg 


Ser 
40 


Ala 


Ser 


Pro 


Lys 


Arg 
45 


Arg 


Lys 


Ser 


Asp 


Ser 
50 


Gly 


Ser 


Thr 


Ser 


Gly 
55 


Gly 


Ser 


Lys 


Ser 


Gin 

60 


Ser 


Arg 


Ser 


Arg 


Ser 


Arg 


Ser 


Asp 


Ser 


Pro 


Pro 


Arg 


Gin 


Val 


His 


Arg 


Gly Ala 


Pro 


Tyr 


65 










70 










75 










80 


Lys 


Gly 


Ser 


Glu 


Val 
85 


Arg 


Gly 


Ser 


Arg 


Lys 
90 


Ser 


Lys 


Asp 


Cys 


Lys 
95 


Tyr 
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Leu 


Thr 


Gin 














Ser Arg 


Ser 


Ser 




Arg 


Ser 








100 










105 








110 






Arg 


Ser 


Arg 


Ser 


Arg 


Glu 


Arg 


Thr 


Asp 


Asn Ser Gly Lys 


Tyr 


Lys 


Lys 






115 










120 








125 








Lys 




His 


Tyr 


Tyr 


Arg 


Asp 


Gin 


Arg 


Arg Glu 


Arg 


Ser 


Arg 


Ser 


Tyr 




130 










135 








140 










Glu 


Arg 


Thr 


Gly His Arg Tyr Glu Arg Asp His 


Pro 


Gly 


His 


Ser 


Arg 


145 










150 








155 










160 


His 


Arg 


Arg 


Cys 


Asp 


Arg 


lie 


Ser 


Gly 


Gly Cys 


Pro 


Trp 


Ser 


Leu 


Pro 










165 










170 








175 




Val 


Gly 


His 


Thr 


Leu 


Ala 


Ser 


Val 


Ala 


Leu 













180 185 
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